lenient相关论文
Efficient Multiagent Policy Optimization Based on Weighted Estimators in Stochastic Cooperative Envi
Multiagent deep reinforcement leing (MA-DRL) has received increasingly wide attention.Most of the existing MA-DRL algori......
2018年《刑事诉讼法》新增认罪认罚从宽制度,未成年被追诉人选择认罪认罚,同样可以从宽。未成年被追诉人认罪认罚“从宽”幅度上限......
研究了明代公讳,认为明代公讳相对于我国避讳历史上的汉、唐、宋、清四个高潮时期宽疏,其原因为:继承了元代避讳不严的遗风,创“五行”......